An MBO scheme for clustering and semi-supervised clustering of signed networks

نویسندگان

چکیده

We introduce a principled method for the signed clustering problem, where goal is to partition weighted undirected graph whose edge weights take both positive and negative values, such that edges within same cluster are mostly positive, while spanning across clusters negative. Our relies on graph-based diffuse interface model formulation utilizing Ginzburg-Landau functional, based an adaptation of classic numerical Merriman-Bence-Osher (MBO) scheme minimizing functionals. The proposed object ive function aims minimize total weight inter-cluster positively-weighted edges, maximizing negatively-weighted edges. scales large sparse networks, can be easily adjusted incorporate labelled data information, as often case in context semisupervised learning. tested our number synthetic stochastic block models real-world sets (including financial correlation matrices), obtained promising results compare favourably against state-of-the-art approaches from recent literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi supervised clustering for Text Clustering

ABSTRACT: Based on clustering algorithm Affinity Propagation (AP) I present this paper a semisupervised text clustering algorithm, called Seeds Affinity Propagation (SAP). There are two main contributions in my approach: 1) a similarity metric that captures the structural information of texts, and 2) seed construction method to improve the semisupervised clustering process. To study the perform...

متن کامل

Semi-supervised Clustering on Heterogeneous Information Networks

Semi-supervised clustering on information networks combines both the labeled and unlabeled data sets with an aim to improve the clustering performance. However, the existing semi-supervised clustering methods are all designed for homogeneous networks and do not deal with heterogeneous ones. In this work, we propose a semi-supervised clustering approach to analyze heterogeneous information netwo...

متن کامل

Semi-supervised Clustering

Clustering is an unsupervised learning problem whose objective is to find a partition of the given data. However, a major challenge in clustering is to define an appropriate objective function in order to to find an optimal partition that is useful to the user. To facilitate data clustering, it has been suggested that the user provide some supplementary information about the data (eg. pairwise ...

متن کامل

Semi-Supervised Projected Clustering

Recent studies suggest that projected clusters with extremely low dimensionality exist in many real datasets. A number of projected clustering algorithms have been proposed in the past several years, but few can identify clusters with dimensionality lower than 10% of the total number of dimensions, which are commonly found in some real datasets such as gene expression profiles. In this paper we...

متن کامل

Semi-supervised clustering methods

Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning that there is no outcome variable nor is anything known about the relationship between the observations in the data set. In many situations, however, informat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Communications in Mathematical Sciences

سال: 2021

ISSN: ['1539-6746', '1945-0796']

DOI: https://doi.org/10.4310/cms.2021.v19.n1.a4